Add Backtrace Screen #1270

MrCirdo · 2023-07-19T16:18:57Z

Hello everyone!

This is my first big pull request 😃

The goal of this PR is to add a backtrace screen for process or thread.
Here's what it looks like for a thread :

And for a process:

Behind, I use the tool called eu-stack provided by elf-utils.
The standard output is parsed and printed to the screen.
Currently, I have implemented only the Refresh button. And my world is inspired by TraceScreen and OpenFilesScreen.

I still have some work to do before my work is ready (Formatting, bug fixes, ...).
Currently, this is more of a demonstration than a cool feature 😄 .

What do you think about my work? Is it a feature that can be added?

BenBE

Thank you very much for your PR. The feature itself looks interesting and fits quite nicely with existing functionality.

But, the use of external tools is a bit problematic and is best to be avoided. For the case of retrieving stack traces, there are several libraries available, that might be worth a look (e.g. libunwind). Also I'm not sure if the code as-is would properly run on e.g. FreeBSD or Darwin. Thus I strongly prefer a solution that allows to split out these platform-dependent parts where necessary (in the case of lsof, things are portable enough across all our target platforms so it's not an issue there; not sure about eu-stack though).

While reading through your PR I noticed that this seems to handle debug information. As such, it would be nice to have the module, source file and line be available separately (where available). Also the setting of hiding path names should be respected for module filenames in order to be consistent with the rest of the UI. Taking the highlight basename setting into account would be nice too.

Another code refactoring task is related to a recent addition of the generalized columns code that @natoscott recently worked on. Please have a look there if you like.

Also there's a few further notes regarding the code which you can find below. Please feel free to rebase the fixes to those issues directly into the existing commits as you see fit.

If you need further assistance feel free to ask.

Action.c

BacktraceScreen.c

BacktraceScreen.h

MrCirdo · 2023-07-20T16:04:05Z

Thank you for your reactivity and your review.

I agree with you when you say eu-stack is not a good idea. I only used it to make a quick demo of my idea.
I'm also unsure if eu-stack is supported on the BSD platforms.

Moreover, I think also the library unwind is more appropriate. I just have to deal with DWARF information.
I will probably close all suggestions regarding the execution/parsing of eu-stack.

I saw very quickly the work of @natoscott and I will wait until his work is finished.

BacktraceScreen.c

BenBE · 2023-07-20T21:34:47Z

Thank you for your reactivity and your review.

You're welcome.

I agree with you when you say eu-stack is not a good idea. I only used it to make a quick demo of my idea. I'm also unsure if eu-stack is supported on the BSD platforms.

There seems to be some patches re FreeBSD, but I'd not actually call this support … ;-)

That's with Darwin aside entirely … ;-)

Moreover, I think also the library unwind is more appropriate. I just have to deal with DWARF information. I will probably close all suggestions regarding the execution/parsing of eu-stack.

Sure, go ahead. Maybe check for similar places elsewhere in case something similar was missed in other places of this PR.

I saw very quickly the work of @natoscott and I will wait until his work is finished.

If you prepare your PR to anticipate that change, e.g. by preparing the data structures in a way that makes this move easy, you could even prepare things now. Given the remaining aspects in that other PR will take some time to sort out, don't feel obliged to wait for those changes to land. If the structures introduced in this PR are clean enough there's not too much work later to move things over to this new interface. Most of the work actually has already been done.

Explorer09 · 2023-09-05T16:07:07Z

Please don't merge the htop-dev:main branch. Rebase it instead. You know what git rebase is, don't you?

MrCirdo · 2023-09-05T20:02:32Z

The flake commit is just for me. I will remove it at the end.
I don't start my work. However, I keep updating my branch. So it's not ready to review.

MrCirdo · 2023-11-16T18:33:09Z

Hey,

I didn't have a lot of time recently for this PR. But I have some questions.

If you prepare your PR to anticipate that change, e.g. by preparing the data structures in a way that makes this move easy, you could even prepare things now. Given the remaining aspects in that other PR will take some time to sort out, don't feel obliged to wait for those changes to land. If the structures introduced in this PR are clean enough there's not too much work later to move things over to this new interface. Most of the work actually has already been done.

I'm looking for Row workflow. I don't know if it's a good idea to use it. Row_display uses the object Settings to get fields. Does it mean I have to make my own Settings object? With all process settings (like highlightThreads)? And do I have to make my ScreenSettings? Is it okay?

BenBE · 2023-11-16T18:45:06Z

I didn't have a lot of time recently for this PR. But I have some questions.

If you prepare your PR to anticipate that change, e.g. by preparing the data structures in a way that makes this move easy, you could even prepare things now. Given the remaining aspects in that other PR will take some time to sort out, don't feel obliged to wait for those changes to land. If the structures introduced in this PR are clean enough there's not too much work later to move things over to this new interface. Most of the work actually has already been done.

I'm looking for Row workflow. I don't know if it's a good idea to use it. Row_display uses the object Settings to get fields. Does it mean I have to make my own Settings object? With all process settings (like highlightThreads)? And do I have to make my ScreenSettings? Is it okay?

The Settings and ScreenSettings objects are shared between all tabs. The Settings provides with the global settings, applicable to all screens, while ScreenSettings is applicable to only to the current tab only. As you are basically inheriting the settings from the tab you are opening the backtrace details on, you can just copy the pointer to those settings. These are held in the Screen mostly for convenience and to reduce dependency scope. I.e. if you don't need any of the values from the settings (and none of the functions you're relying on) you could even skip these objects entirely.

MrCirdo · 2023-11-17T15:44:32Z

I didn't have a lot of time recently for this PR. But I have some questions.

If you prepare your PR to anticipate that change, e.g. by preparing the data structures in a way that makes this move easy, you could even prepare things now. Given the remaining aspects in that other PR will take some time to sort out, don't feel obliged to wait for those changes to land. If the structures introduced in this PR are clean enough there's not too much work later to move things over to this new interface. Most of the work actually has already been done.

I'm looking for Row workflow. I don't know if it's a good idea to use it. Row_display uses the object Settings to get fields. Does it mean I have to make my own Settings object? With all process settings (like highlightThreads)? And do I have to make my ScreenSettings? Is it okay?

The Settings and ScreenSettings objects are shared between all tabs. The Settings provides with the global settings, applicable to all screens, while ScreenSettings is applicable to only to the current tab only. As you are basically inheriting the settings from the tab you are opening the backtrace details on, you can just copy the pointer to those settings. These are held in the Screen mostly for convenience and to reduce dependency scope. I.e. if you don't need any of the values from the settings (and none of the functions you're relying on) you could even skip these objects entirely.

Thank you very much for the clarification. I was not sure I would be able to modify it (I mean If my changes would be accepted).

BenBE

Please no custom build system files from e.g. Flake …

BacktraceScreen.c

BacktraceScreen.h

MrCirdo · 2024-01-09T09:29:14Z

Hi @BenBE ,
I completely forgot to say to not review my code. I just updated my branch.
I deeply apologize.

BenBE · 2024-01-09T12:13:42Z

Hi @BenBE , I completely forgot to say to not review my code. I just updated my branch. I deeply apologize.

Don't worry. No need to apologize. The PR is marked as draft anyway, thus the only things I remarked upon was what I noticed immediately. I also ticked off some of the previous comments that no longer apply to the current state of this PR. Basically some maintenance.

NB: This PR would be scheduled for 3.4.x earliest anyway given that 3.3.0 is about to ship anytime soon.

MrCirdo · 2024-03-16T17:35:44Z

Hi,

This PR is ready for review.
I closed all previous conversations.

Currently, I add only the support of Linux.

configure.ac

BacktraceScreen.c

BacktraceScreen.h

configure.ac

BenBE · 2024-03-17T11:00:15Z

Given that this feature will be available on multiple platforms and requires changes in different places I'd suggest to have a top-level feature flag for backtrace support and additional flags for the various libraries (iberty, eu-stack, …) The actual implementation for the stack traces should go in the platform code for each system, leaving the actual screen implementation mostly platform independent.

MrCirdo · 2024-03-19T21:04:26Z

Given that this feature will be available on multiple platforms and requires changes in different places I'd suggest to have a top-level feature flag for backtrace support and additional flags for the various libraries (iberty, eu-stack, …) The actual implementation for the stack traces should go in the platform code for each system, leaving the actual screen implementation mostly platform independent.

Ok I see what you mean, is it a good idea to be inspired by for example Machine_new?

BenBE · 2024-03-19T21:26:29Z

Given that this feature will be available on multiple platforms and requires changes in different places I'd suggest to have a top-level feature flag for backtrace support and additional flags for the various libraries (iberty, eu-stack, …) The actual implementation for the stack traces should go in the platform code for each system, leaving the actual screen implementation mostly platform independent.

Ok I see what you mean, is it a good idea to be inspired by for example Machine_new?

Maybe a better candidate may be the way in which Platform_getNetworkIO and Platform_getProcessLocks are implemented in <platform>/*Platform.[ch] …

MrCirdo · 2024-03-20T20:36:58Z

Given that this feature will be available on multiple platforms and requires changes in different places I'd suggest to have a top-level feature flag for backtrace support and additional flags for the various libraries (iberty, eu-stack, …) The actual implementation for the stack traces should go in the platform code for each system, leaving the actual screen implementation mostly platform independent.

Ok I see what you mean, is it a good idea to be inspired by for example Machine_new?

Maybe a better candidate may be the way in which Platform_getNetworkIO and Platform_getProcessLocks are implemented in <platform>/*Platform.[ch] …

Okay thank you, I'll take a look

BacktraceScreen.c

XUtils.c

MrCirdo · 2024-10-28T17:21:14Z

I apologize for the X * const variable, I didn't pay attention...

BacktraceScreen.h

BacktraceScreen.c

BacktraceScreen.h

BacktraceScreen.c

Co-authored-by: Benny Baumann <[email protected]> Co-authored-by: Kang-Che Sung <[email protected]>

BenBE · 2024-11-06T09:41:09Z

For reference:
https://graphics.stanford.edu/~seander/bithacks.html#IntegerLog10

MrCirdo · 2024-11-07T13:29:40Z

For reference:
https://graphics.stanford.edu/~seander/bithacks.html#IntegerLog10

Thank you for the link, It's a goldmine of information.

Normally, I should have corrected all the comments. If so, my work is done. I hope it's ready to merge!

Explorer09 · 2024-11-07T20:43:46Z

XUtils.c

+
+   size_t res = 0;
+   while (n != 0) {
+      n /= base;


Looks like a naive implementation, but using division is slow. I don't know what kind of performance you and @BenBE are aiming for.

The version in the link I pointed too gives a constant time version.

Given most numbers here should be fairly small, the number of iterations here is kinda small too. Thus unless division is a lot slower than this might still be faster compared to the lookup variant.

Some hard numbers here might be good, maybe when comparing even compare with the float variant as a reference.

But, given how little this function is called (once/twice per process maybe) I don't think it's worth optimizing for one or two CPU cycles here.

@BenBE No, I mean, it's better to use multiplication even when you are optimizing this function for small numbers. You are not converting base anyway and thus you don't need remainders for every division.

I'm here to contribute my version: It generates slightly smaller machine code (for x86). 🙂

https://godbolt.org/z/6aWvcc3z8

size_t countDigit_2(size_t n, size_t base) { assert(base > 1); size_t res = 1; size_t limit = base; while (n >= limit) { res++; if (limit > SIZE_MAX / base) { break; } limit *= base; } return res; }

@Explorer09 Good catch/suggestion. Let's take it a another (favoured) candidate.

Also, your's can be shortened somewhat:

size_t countDigit_2(size_t n, size_t base) { assert(base > 1); size_t res = 1; for (size_t limit = base; n >= limit; limit *= base) { res++; if (limit > SIZE_MAX / base) { break; } } return res; }

The SIZE_MAX / base is a runtime constant, that the compiler will likely calculate once before the loop (if -fmove-loop-invariants is used, which is the default for -O1 and above, but not -Og). This could be made explicit:

size_t countDigit_2(size_t n, size_t base) { assert(base > 1); size_t res = 1; for (size_t limit = base, max_limit = SIZE_MAX / base; n >= limit; limit *= base) { res++; if (limit > max_limit) { break; } } return res; }

In this case, even -O0 and -Og would benefit from avoiding this additional division.

With -fipa-cp the division is moved to compile time, which is the default for -O2, -O3, and -Os.

@BenBE No. My intention is to let the compiler optimize (limit > SIZE_MAX / base) into __builtin_umulll_overflow(limit, base, &res). There would be no need to compute SIZE_MAX / base. Just check the overflow flag in the processor register. Technically no division.

Ah, wasn't aware of that builtin. Thx.

Just to mention I found a missed optimization when working with this countDigits function:
GCC bug report, LLVM bug report

README.md

Explorer09 · 2024-11-07T20:53:21Z

BacktraceScreen.c

+
+static void BacktraceFrame_displayHeader(const BacktraceFrame* frame, RichString* out) {
+   const BacktracePanelPrintingHelper* printingHelper = &frame->backtracePanel->printingHelper;
+   assert(printingHelper);


Redundant assert? (Since an & operator is used for retrieving address for printingHelper)

Also, if at all you would check assert(frame);, which is better simply declared as ATTR_NONNULL_ALL because you want the RichString* to be valid too.

BacktraceScreen.c

Explorer09 · 2024-11-07T21:16:34Z

BacktraceScreen.c

+
+   char* basename = frame->backtracePanel->process->procExe + frame->backtracePanel->process->procExeBasenameOffset;
+   char* object = line + objectPathStart;
+   if (!basename || !object) {


Looks like something is wrong with this conditional. Perhaps you mean (!frame->backtracePanel->process->procExe || !line) because you should ignore the character offset when the pointer is null already.

Good catch, but also not quite correct, because object is a pointer addition too and thus should check line.

Alternative:

char* basename = frame->backtracePanel->process->procExe ? frame->backtracePanel->process->procExe + frame->backtracePanel->process->procExeBasenameOffset : NULL; char* object = line ? line + objectPathStart : NULL; if (!basename || !object) {

In which case the current check would work just fine.

You could even do a small macro #define PTR_ADD_OFFSET(ptr, offset) (ptr) ? ((ptr) + (offset)) : NULL.

Explorer09 · 2024-11-08T07:17:24Z

BacktraceScreen.c

+#define BACTRACE_PANEL_HEADER_NUMBER_FRAME "#"
+#define BACTRACE_PANEL_HEADER_ADDRESS "ADDRESS"
+#define BACTRACE_PANEL_HEADER_NAME "NAME"
+#define BACTRACE_PANEL_HEADER_PATH "PATH"


Any reason to have these strings defined as macros?

Unless used in multiple places, I think they should be removed.

I made this decision because it's easier to change the name of column header. Currently only BACTRACE_PANEL_HEADER_NUMBER_FRAME is used twice.
I recognize the name can be shorten.

@MrCirdo The header names in htop are usually stored in an array or a table structure, so it's unlikely you need macro tokens for these.

Explorer09 · 2024-11-08T07:26:30Z

BacktraceScreen.c

+      (int)printingHelper->maxFrameNumLen, BACTRACE_PANEL_HEADER_NUMBER_FRAME,
+      (int)printingHelper->maxAddrLen, BACTRACE_PANEL_HEADER_ADDRESS,
+      (int)maxFunctionNameLength, BACTRACE_PANEL_HEADER_NAME,
+      (int)printingHelper->maxObjPathLen, BACTRACE_PANEL_HEADER_PATH


I feel a little uncomfortable when the length values originally in size_t types have to downcast to int here. This suggests me that the original values should have been in int types already, and not size_t.

I know this question should be up to @BenBE to answer.

Technically, the C standard is at fault here. printf should have defined this to be size_t (or ssize_t for this matter), but this would have broken backward compatibility and far too much code.

Instead, all references to these variables should ensure to stick within the limits of int; consult limits.h for some useful macros. This is in practice no problem, as going beyond INT_MAX or even INT16_MAX will go outside of what curses can handle (pun intended).

Put simply: place some asserts for these limits in strategic places and we should be fine.

Explorer09 · 2024-11-08T07:27:54Z

BacktraceScreen.c

+   char* line = NULL;
+   int objectPathStart = -1;
+   int len = xAsprintf(&line, "%-*d 0x%0*zx %-*s %n%-*s",
+      (int)printingHelper->maxFrameNumLen, frame->index,


How about right aligning the frame number field?

Explorer09 · 2024-11-08T07:38:22Z

linux/Platform.c

+
+addr_space_error:
+   unw_destroy_addr_space(addrSpace);
+#else


This would reduce my pain reading the code (because there are nested preprocessor conditionals above):

Suggested change

#else

#else /* !HAVE_LIBUNWIND_PTRACE */

Explorer09 · 2024-11-12T12:15:54Z

BacktraceScreen.c

+
+      if (frame->functionName) {
+         size_t functionNameLength = strlen(frame->functionName) + digitOfOffsetFrame;
+         printingHelper->maxFuncNameLen = MAXIMUM(functionNameLength, printingHelper->maxFuncNameLen);


What should happen if both frame->functionName and frame->demangleFunctionName are NULL? It seems that digitOfOffsetFrame would be unused. Was that intentional?

Explorer09 · 2024-11-12T12:21:22Z

BacktraceScreen.c

+
+   size_t addressLength = MAX_HEX_ADDR_STR_LEN_32;
+   if (longestAddress > MAX_ADDR_32) {
+      addressLength = MAX_HEX_ADDR_STR_LEN_64;


How about this for more-than-32-bit addresses?

addressLength = strlen("0x") + countDigits(longestAddress, 16);

BenBE added the new feature Completely new feature label Jul 19, 2023

BenBE requested changes Jul 19, 2023

View reviewed changes

BenBE mentioned this pull request Jul 20, 2023

Introduce Row and Table classes for tabular screens beyond top-processes #1243

Closed

Explorer09 reviewed Jul 20, 2023

View reviewed changes

BacktraceScreen.c Outdated Show resolved Hide resolved

Explorer09 reviewed Jul 20, 2023

View reviewed changes

BacktraceScreen.c Outdated Show resolved Hide resolved

Explorer09 reviewed Jul 20, 2023

View reviewed changes

BacktraceScreen.c Outdated Show resolved Hide resolved

MrCirdo force-pushed the main branch from 0b0d5f9 to f5e3801 Compare September 5, 2023 20:01

MrCirdo force-pushed the main branch from f5e3801 to 82f41e3 Compare January 8, 2024 20:35

BenBE requested changes Jan 9, 2024

View reviewed changes

BacktraceScreen.c Outdated Show resolved Hide resolved

BacktraceScreen.c Outdated Show resolved Hide resolved

BacktraceScreen.c Show resolved Hide resolved

BacktraceScreen.c Show resolved Hide resolved

BacktraceScreen.h Show resolved Hide resolved

MrCirdo force-pushed the main branch from 82f41e3 to cfe5300 Compare March 16, 2024 17:33

MrCirdo marked this pull request as ready for review March 16, 2024 17:35

Explorer09 reviewed Mar 16, 2024

View reviewed changes

configure.ac Outdated Show resolved Hide resolved

Explorer09 reviewed Mar 16, 2024

View reviewed changes

BacktraceScreen.c Outdated Show resolved Hide resolved

MrCirdo force-pushed the main branch 2 times, most recently from c53af06 to 792bdba Compare March 16, 2024 20:46

Explorer09 reviewed Mar 17, 2024

View reviewed changes

BacktraceScreen.c Outdated Show resolved Hide resolved

BacktraceScreen.c Outdated Show resolved Hide resolved

BacktraceScreen.h Outdated Show resolved Hide resolved

configure.ac Outdated Show resolved Hide resolved

Explorer09 reviewed Oct 26, 2024

View reviewed changes

BacktraceScreen.c Outdated Show resolved Hide resolved

Explorer09 reviewed Oct 26, 2024

View reviewed changes

BacktraceScreen.c Outdated Show resolved Hide resolved

Explorer09 reviewed Oct 26, 2024

View reviewed changes

XUtils.c Show resolved Hide resolved

MrCirdo force-pushed the main branch from 9f599f1 to 507d351 Compare October 28, 2024 17:25

MrCirdo commented Oct 28, 2024

View reviewed changes

BacktraceScreen.h Outdated Show resolved Hide resolved

MrCirdo force-pushed the main branch from 507d351 to f0fddea Compare October 28, 2024 17:29

Explorer09 reviewed Oct 28, 2024

View reviewed changes

BacktraceScreen.c Show resolved Hide resolved

Explorer09 reviewed Oct 28, 2024

View reviewed changes

BacktraceScreen.h Show resolved Hide resolved

Explorer09 reviewed Oct 28, 2024

View reviewed changes

BacktraceScreen.c Outdated Show resolved Hide resolved

MrCirdo force-pushed the main branch from f0fddea to f094b7d Compare October 30, 2024 17:06

MrCirdo and others added 4 commits November 6, 2024 10:16

Add a function that counts digits

b65dba1

Co-authored-by: Benny Baumann <[email protected]> Co-authored-by: Kang-Che Sung <[email protected]>

Add backtrace screen

81a167b

Co-authored-by: Benny Baumann <[email protected]> Co-authored-by: Kang-Che Sung <[email protected]>

Add backtrace Linux dependencies for the CI.

a4cf62e

Co-authored-by: Benny Baumann <[email protected]> Co-authored-by: Kang-Che Sung <[email protected]>

Update the man page for backtrace screen

ec90661

Co-authored-by: Benny Baumann <[email protected]> Co-authored-by: Kang-Che Sung <[email protected]>

MrCirdo force-pushed the main branch from f094b7d to ec90661 Compare November 6, 2024 09:17

Explorer09 reviewed Nov 7, 2024

View reviewed changes

README.md Show resolved Hide resolved

Explorer09 reviewed Nov 7, 2024

View reviewed changes

BacktraceScreen.c Show resolved Hide resolved

Explorer09 reviewed Nov 7, 2024

View reviewed changes

Explorer09 reviewed Nov 8, 2024

View reviewed changes

Explorer09 mentioned this pull request Nov 11, 2024

Introduce countDigits() function for PID and UID field widths #1561

Open

Explorer09 reviewed Nov 12, 2024

View reviewed changes

Add Backtrace Screen #1270

Are you sure you want to change the base?

Add Backtrace Screen #1270

Conversation

MrCirdo commented Jul 19, 2023

BenBE left a comment • edited Loading

Choose a reason for hiding this comment

MrCirdo commented Jul 20, 2023

BenBE commented Jul 20, 2023

Explorer09 commented Sep 5, 2023

MrCirdo commented Sep 5, 2023 • edited Loading

MrCirdo commented Nov 16, 2023 • edited Loading

BenBE commented Nov 16, 2023

MrCirdo commented Nov 17, 2023

BenBE left a comment

Choose a reason for hiding this comment

MrCirdo commented Jan 9, 2024

BenBE commented Jan 9, 2024

MrCirdo commented Mar 16, 2024 • edited Loading

BenBE commented Mar 17, 2024

MrCirdo commented Mar 19, 2024

BenBE commented Mar 19, 2024

MrCirdo commented Mar 20, 2024

MrCirdo commented Oct 28, 2024

BenBE commented Nov 6, 2024

MrCirdo commented Nov 7, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BenBE left a comment •

edited

Loading

MrCirdo commented Sep 5, 2023 •

edited

Loading

MrCirdo commented Nov 16, 2023 •

edited

Loading

MrCirdo commented Mar 16, 2024 •

edited

Loading